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Precopulatory courtship is a high-cost, non-well understood animal 
world mystery. Drosophila's {—D. 's) precopulatory courtship not only 
shows meirked structural similcirities with mammalian courtship, but 
also with human spoken language. This suggests the study of pur- 
pose, modalities and in particular of the power of this language and 
to compcire it to human language. Following a mathematical sym- 
bolic dynamics approach, we translate courtship videos of £).'s body 
language into a formal language. This approach made it possible 
to show that D. may use its body language to express individual 
information - information that may be important for evolutionary 
optimization, on top of the sexual group membership. Here, we use 
Chomsky's hierarchical language classification to chciracterize the 
power of £).'s body language, and then compare it with the power 
of languages spoken by humans. We find that from a formal lan- 
guage point of view, i^.'s body language is at least as powerful as the 
languages spoken by humans. From this we conclude that human 



intellect cannot be the direct consequence of the formal grammar 
complexity of human language. 

Introduction 

Over the centuries, the evolution of human language has been the subject of controversial 
discussions among philosophers, linguists and biologists. Yet, a consensus on what causes 
language to evolve and what are the effects of this on society has not been achieved. Tra- 
ditionally, language was thought of as a strictly culturally transmitted phenomenon, with 
few or no biological ties at all. In the second half of the 20th century, under Chomsky's 
influence who considered that language is located in the brain and therefore is subject 
to biological conditions PQ this view started to change. The discussion arose from 
what the driving force of the evolution of language could be. An important observation 
is that language - as is any complex ability of humans or animals - is the result of natural 
selection [3]. Chomsky and his school remained, however, skeptical about approaches 
that saw this as the only driving force. They suggested that language grammar may 
have emerged as a side-effect of the reorganization of the brain needed for coping with its 
growing size during evolution towards the modern Homo sapiens [U [5] . In order to study 
the evolution of language and to determine its driving forces, a classiflcation of languages 
accounting for the changes undergone would be helpful. To capture the grammatical 
complexity aspect of languages, Chomsky and Schiitzenberger [6] proposed a hierarchical 
classiflcation scheme, comprising grammars of increasing grammatical complexities: type 
t-3 (left regular grammar) C type t-2 (context free grammar) C type t-1 (context sensitive 
grammar) C type t-0 (Turing machine). This classiflcation has proven extremely useful in 
different flelds of comparative sciences. It has been used e.g. to compare spoken human 
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languages, for distinguishing compiler languages, as a basis for the theory of automata, 
and for classifying dynamical systems. A natural question is whether the more advanced 
organizational forms are generally equipped with more complex language structures? Our 
study will answer a more specific - but similarly fundamental - question: Do more com- 
plex organizations (society, intelligence,..) require a language representation of increased 
grammatical complexity? 

To answer this question, we compare the grammatical complexity of human languages 
(which are known to fall mostly into Chomsky hierarchy type t-2 [ZIIH]) with experimental 
data from the precopulatory courtship body language of the fruit fly Drosophila. In 
the animal world, courtship ranges from simple rituals to complex communication-like 
behaviors. Despite its high cost for the animal (energy- and death toll-wise), the origins 
and purpose of courtship are still not well understood. A natural hypothesis is that 
courtship is an evolutionary optimization mechanism that a species may or may not take 
advantage of. Living in a simple and evolutionarily fast environment, D. provides a well- 
suited testing case. Until recently, investigations on this nature of D.'s courtship were 
hampered by the lack of a conceptual framework able to address this question. Behavior 
is characterized by rituals that consist of well-chosen sequences of individual actions. 
Since it is in the nature of these rituals that they need to be repeated if required, we 
characterize behavior by sequences of indecomposable closed cycles of indecomposable 
individual actions, so-called irreducible cycles of irreducible acts PUHj. This approach is 
also motivated by the theory of complex dynamical systems, where it has been shown that 
such systems can be reduced to a minimal set of closed sequences of actions (there called 
'irreducible closed orbits'). From this set the system can systematically be approximated 
by combining ever more of these sequences, starting with the shortest ones (for detailed 
references cf. Ref. [in])- Using a decomposition of such data into irreducible cycles, it 
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has been found that with high confidence during D.'s precopulatory courtship, individual 
information is transmitted to the prospective partner, i.e. a real communication with 
essential information exchange takes place between the partners jH |T0] (supplementary 
material's Fig. 2). 

The focus of the present work is on the power of the grammar that underlies the 
generation of the courtship language. To the best of our knowledge we use here for the first 
time Chomsky's classification scheme to characterize courtship and animal body language. 
Although the question by what grammar a given experimental data was generated is in 
its narrower sense undecidable [TT], we are able to provide an answer in the statistical 
sense: Namely, we show that it is very unlikely that D.'s body language is generated 
by grammars of complexity lower than those of human languages. For some data we 
find indications of a type t-1 grammar underlying their generation, which reaches beyond 
the grammatical complexity of human language. An overview of the experimental and 
computational procedures is presented in Fig. 1. 

The data that we use in this study originates from experiments where the courtship 
behavior of a pair of fruit files is recorded in an observation chamber at fixed environ- 
mental conditions of 25 °C and 75% humidity. From high-speed camera recordings of 30 
frames per second, we isolated 37 fundamental behavioral acts and coded the recordings 
accordingly |9] (supplementary material's Fig. 1). Fundamental acts are body movements 
that can freely be combined with each other. Besides pairing single normal females in 
the immature, mature and mated states with single normal males, additionally fruitless 
mutant males [9] were paired with either mature females or with mature normal males, 
leading to five types of experiments. Since either of the protagonists gives rise to a time 
series, ten classes of experimental time series were obtained in this way. Tagging each 
fundamental act by an integer number, each camera episode is represented by a string 
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or time series of these symbols. A mature female as the protagonist in the presence of a 
normal male, e.g., generates in this way a time series as 

uj = {9, 17, 21, 20, 17, 20, 6, 21, 6, 21, 17, 18, 21, 25, 20, 17, 20, 21, 17, 18, 21, 17, 20, 9, 17, 20, 21, 
20, 21, 17, 21, 17, 18, 21, 17, 21, 20, 24, 17, 18, 20, 21, 17, 21, 20, 17}. 



Statistical generative grammar model 

The simplest grammatical model for the putative generation of the experimental time 
series is a a grammar of type t-3 of the Chomsky hierarchy of languages. This model is 
equivalent to a random walk on the given set of symbols with probabilities given by the 
symbol frequencies observed in the respective experiments, but with no further restrictions 
imposed. If D.'s body language is of low complexity, the observed strings should fit well 
into the random walk model. From simulating the random walk based on the observed 
symbol probabilities of each experiment, we obtained from each experimental file a set 
of surrogate files to compare with (see Fig. lA; throughout our investigations, we use 
Nsim — 100 simulated random walks). For the comparison, a figure of merit is used. 
Every time series ui — {xq, Xi, xl\ is characterized by products along the string of the 
probabilities Pin{x) - measuring that a random walk starting at Xq ends at point x - with 
Pout{x) measuring the probability that a random walk starting at x reaches point xl- 
For the unrestricted random walk, these probabilities are 



Pin{x) = — ^ f-Pl ••••Pn 

(AT „\| „(JVl-ni) J^'^^symb-'^^symb) 



where n is the number of steps needed to reach point x, producing rij repetitions of the 
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symbol tagged with index j. 

The entropy Hthrough associated with a string reahzation is based on the local walk- 
through probability Pthrough '■— Pin " Pouti evaluated along the string, as 



n-throughy^) '■— j — — y / ^^^ZK^^throughK^i)) —'■ J / ^ J^throughyXj) ■, 



1=1 1=1 



with Xi — {n{,n\, ■■■j^n^j^^J the coordinate of point Xi & uj in the symbol space. In the 
figures, Hthrough will always be displayed as H, see Fig. IB. 



Courtship language classification 

We evaluated Hthrough{i^) for each experiment and for the corresponding random walks. 
For the latter, we also determined the mean values and the standard deviations, see 
Fig. IC. Whereas the t-3 model generates strings with similar Hthrough{x) characteristics 
for approximately one third of the experimental data for the remaining two thirds, this 
description fails. A t-3 example is given in Fig. IB, left panel; an example where the 
t-3 model fails is shown in Fig. IB, right panel. In the latter cases, the experimental 
Hthroughi^) dramatically differs from those obtained for the t-3 model: The experiment's 
clear peak around position 170 is very unlikely to be reproduced by a simple random walk. 
The pyramid-like shape with its clear maximum of Hthrough suggests that in the data, an 
eminent change has occurred in the way of how symbols are chosen from the alphabet. 

To proceed with those experiments that do not fit into a t-3 model, we apply a recursive 
approach ('t-3, t-2, t-1 model'). We split a string at the point of maximum Hthroughi^), 
and model the partial strings oui, separately by corresponding random walks. Strings 
of the form uj = (Ji(J2 are generated from a t-2 (i.e. context-free) grammar, since a 
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word UJ = a^b^, n G N cannot be created by a t-3 grammar, t-2 grammars reproduce 
the characteristics of five of our experiments, they remain, however, to be inappropriate 
for about half of the data. The obvious solution then is to expand the latter into ever 
more partial walks. Technically, for each file we simulate a set of Ngim random walks. 
On this set, we calculate Hthroughij^) i their average and their standard deviation. If the 
original file's Hthroughi'^) falls within a standard deviation from the computed average, the 
random walk describes the string well and the string is considered to be t-3. Otherwise, 
by splitting the string oj at the maximum of Hthrough{x), we obtain ui and U2. For 
these partial strings, random walks are then performed separately, and compared to the 
original data. If they are close enough in the above sense, we consider the string to be 
t-2. Otherwise we proceed recursively, which implies context-sensitivity [12] and therefore 
a t-1 grammar. An example of our iterative t-3, t-2, t-1 procedure is given in Fig. 2A. 
If we compare Hthrough of surrogate walks generated according to need by t-3, t-2, and 
t-1 constructions (green points in Fig. 2B), we see that the obtained values are hardly 
distinguishable from the experimental data. 

A natural question is whether the obtained results could in a simpler way be generated 
by a sequence of type t-3 grammars. In order to investigate this possibility, we checked 
the abundance of irreducible cycles that provide our mathematical basis for capturing 
behavior |9l HUj- For a succession of type t-3 grammars, their number should not differ 
in an essential way from the number obtained by simple type t-3 random walks. We 
observed, however, a massive increase of the irreducible closed cycles from files that we 
classified as type-2 or type-1, as is exhibited in Fig. 3A. This result corroborates the 
expectation that an increase of the number of closed cycles could serve as the hallmark 
of higher grammars [10] and provides a further argument for the abundant use of higher 
grammars in D.'s body language. 
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Conclusion and discussion 



The comparison between the behavior of all observed female flies and all observed normal 
males uncovers that whereas female flies tend to use type t-3 or t-2 grammars, normal 
males prefer type t-1 (Fig. 3B). This provides a novel insight into the role of the courtship 
protagonists depending on their sexual group from the grammatical perspective. More 
fundamentally, we stress that in view of the presented results, D.'s precopulatory body 
language is not the result of the simplest grammar type t-3 (i.e., a random walk on 
states of a finite automaton). There is a general agreement that natural human languages 
fall mostly into the type t-2 of Chomsky's characterization (with among the European 
languages the Swiss-German and the Dutch showing the highest degree of grammatical 
complexity [8]). On the basis of our analysis one can safely say that the D.'s body language 
is of no lesser grammatical complexity than the spoken language of humans. 

From our findings one also has to conclude that aspects of spoken language that we 
often take as given are not reflected in the grammatical complexity. In particular it is not 
possible to conclude from language complexity the developmental level / intelligence of 
an organism. More complex worlds seem not to require more complex grammars. 

The supremacy of human intellect can thus not be founded in the formal grammatical 
complexity of the language being used. It emerges, rather surprisingly, that lower level 
species have recursive elements too (recursion is often the key argument for distinguishing 
between t-3 and t-2 grammars, see Ref. [E]). It appears, however, that only humans have 
acquired a kind of awareness of theses structures and have learnt to purposefully use them. 

This work was supported by the Swiss National Science Foundation SNF grant 200021- 
122276 to R.S. 
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experimental string t-3 random walk simulations 




10 ^ 40 



Figure 1: A) We compare the experimental symbol strings with strings of equal length 
generated from a t-3 random walk based on the observed symbol probabilities. B) For each 
string (observed and simulated), Hthrough{x) is calculated. Thick red lines: experiments, 
thin lines: t-3 random walks. C) Hthroughiy) calculated across the data set wraps up the 
results: Red dots: Experiments; blue dots: Mean values of Nsim = 100 t-3 random walks; 
bars: one standard deviation. For two thirds of all files, the t-3 model fails. 
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Figure 2: A) Experiment E=23 requires a t-1 random walk. B) Hthroughi'^) for each 
experiment and its surrogate set. Red dots: experiments. Blue: dots: mean values of 
Nsim = 100 t-3 random walks; bars: one standard deviation. Green: mean values of 
Nsim = 100 t-3, t-2, t-1 random walks; bars: one standard deviation. Blue dots and bars 
of t-3 files are obscured by red and green dots and green bars. One can clearly see that 
the green dots approximate the experimental red dots much better than the blue dots. 
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Figure 3: A) Histogram of the cumulative number of closed cycles found in a) t-3 random 
walks, b) in t-3, t-2, t-1 random walks according to the data's classification, across all 
data files. The experimental data ("exp") with 468 cycles fits well only into the t-3, 
t-2, t-1 model. Histograms are based on 100 simulations for all experimental files |13j . 
B) Distribution of t-3, t-2, t-1 classes (with absolute numbers indicated): a) across all 
experiments, b) across all experiments with females, c) across all experiments with normal 
males. 
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